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Response to Amendment 
Specification 

1 . Applicants' response to the last Office Action, filed December 21 , 2007 has been 
entered and made of record. 

2. The objection to the title as not descriptive has been withdrawn in light of the 
amended title. 

3. The objection to the drawings has been withdrawn in light of the amended 
drawing sheets. 

4. The amended specification has been entered and made of record. 

Response to Arguments 

5. Applicant's arguments, filed December 21 , 2007, with respect to the rejection(s) 
of claim(s) 1, 7, 9, 10, 11, 17, 19, and 20 under Nakamura (35 U.S.C. 102(b)) have 
been fully considered and are persuasive. Therefore, the rejection has been withdrawn. 
However, upon further consideration, a new ground(s) of rejection is made in view of 
Nakamura in view of Barski (35 U.S.C. 103(a)). 

DETAILED ACTION 

Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 



(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
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invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1. Claims 1,7-11, and 17-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Osamu Nakamura et al., "Extraction Of Photographic Area From 
Document Images" (hereinafter "Nakamura"), further in view of Barski et al., US 
5,212,741 (hereinafter "Barski"). 

Regarding claims 1 0 and 20, which are representative of claims 1 , 7, 9, 1 1 , 1 7, 
and 19, Nakamura discloses a method and device for binarizing an image (see page 
80, section 3 where Nakamura discusses bi-level quantization of an image), 
comprising: an input part for receiving an image (figure 1 shows an input document 
image); a block classification part for dividing the received image into blocks (see page 
77, section 2.1(3) where Nakamura discusses segmenting the image into blocks), 
and classifying the divided blocks into character blocks and background blocks (see 
page 77-78, section 2.1(4) and 2.2 where Nakamura discusses classifying the 
blocks based on attributes); a block growing part for growing the classified blocks 
(page 79, section 2.3(2): blocks are expanded), and restoring a misclassified block 
(figure 6 shows blocks reclassified during the expansion process); a block 
grouping part for grouping a character block output from the block growing part with its 
neighboring blocks, thereby generating a grouped block (see page 79, section 2.3(2) : 
merging blocks that become adjacent); a block splitting part for separating the 
character block from the grouped block (see page 79, section 2.3(3) where Nakamura 
discusses dividing integrated areas into sub-blocks); and a binarization part for 
binarizing pixels of character blocks into a first brightness value for character pixels and 
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a second brightness value for background pixels by comparing the pixels of the 
character blocks with a threshold, and binarizing pixels of background blocks output 
from the block classification part into the second brightness value (see page 80, 
section 3, where Nakamura discusses the use of bi-level and notchless bi-level 
quantization of an image to threshold the image into a binary equivalent image). 

While Nakamura discloses the block integration process for use on blocks 
classified as pictures elements, it would have been obvious to apply this same 
procedure to the text classified blocks, as the same results of grouping similarly 
classified blocks can be expected. Nakamura discloses the use of an edge 
enhancement operation (page 77, section 2.1(1): a Sobel operator is used to better 
define edges in the image) for use in classification, but does not use this on the image 
after classification. Also, the binarization process in Nakamura does not use a 
threshold established during the edge enhancement stage. Barski discloses an edge 
enhancement part for enhancing edges of a character block using relations between 
neighboring pixels in the character block, (column 6-7, lines 63-4: a mask which uses 
surrounding pixel values is used to enhance characters of a text area); and 
generating a threshold for distinguishing character pixels and background pixels of the 
character block which is used in a binarization process (column 7, lines 20-28: a 
histogram of the edge enhanced image is used to calculate a threshold used in 
binarization). 

It would have been obvious to one skilled in the art at the time the invention was 
made to modify the invention of Nakamura, and modify the binarization process to 



Application/Control Number: 10/767,061 Page 5 

Art Unit: 2624 

include an edge enhancement portion, as taught by Barski, thus bringing out higher 
frequency line details and improving optical character recognition of text, as discussed 
by Barski (column 6, lines 65-66 and column 2, lines 40-41). 

Regarding claims 8 and 18, Nakamura discloses the method and device of 
claims 7 and 17, wherein the block growing part comprises: a dilation part for growing a 
character block and changing a block containing a character pixel, classified as a 
background block, to a character block (see page 79, section 2.3(2) where Nakamura 
discusses expanding blocks by one width on each side); and a closing part for 
eroding the dilated character block and deducting connected blocks (see page 79, 
section 2.2(5) where Nakamura discusses reclassifying blocks). Furthermore, the 
examiner takes official notice that the sequential process of dilation then erosion is 
known as closing and is known in the art to fill in small holes and gaps in pixel data. 

2. Claims 2, 3, 6, 12, 13, and 16 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Nakamura in view of Barski as applied to claims 1 and 1 1 above, and 
further in view of Shan Mo et al., "Adaptive, Quadratic Preprocessing Of Document 
Images For Binarization" (hereinafter "Mo"). 

Regarding claims 6 and 16, are representative of claims 2 and 12, Barski 
discloses an edge enhancement part in which any suitable edge enhancement 
technique can be used (column 7, lines 7-9). Mo discloses an edge enhancement 
technique comprising: a first threshold selection part for calculating a first threshold for 
classifying each pixel of the character block as a character pixel or a background pixel 
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(page 995, section IIA1, column 1: finding a global threshold level); a mean 
computation part for classifying pixels of the character block into character pixels and 
background pixels on the basis of the first threshold (page 995, section IIA1, column 
1: determining whether a pixel is a background or character pixel using the 
threshold), and calculating mean brightness values for character pixels and 
background pixels of the character block (page 995, section IIA1, column 2: calculate 
average gray levels for character and background pixels separately); a 
normalization part for normalizing the pixels of the character block using the mean 
brightness value for character pixels and the mean brightness value for background 
pixels output from the mean computation part so that the character pixels have a value 
close to T while the background pixels have a value close to '0' (page 995, section 
IIA1, column 2: normalize the data; column 1: with values close to zero and one); 
a quadratic operation part for performing a quadratic operation on the normalized 
character block to enhance edges of the character block and reduce noise of the 
character block (page 995, section IIA3: apply the quadratic filter discussed on 
page 994 to the normalized data); a denormalization part for denormalizing the 
quadratic-processed character block and providing the denormalized character block to 
the binarization part (page 995, section IIA4: the output of the quadratic filter 
denormalizes by mapping output values close to desired values for background 
and character pixels); and a second threshold selection part for calculating a second 
threshold for classifying each pixel of the denormalized character block as a character 
pixel or a background pixel, and outputting the second threshold as a threshold for the 
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binarization part (page 995, section IIA4: calculating a threshold for use in 
binarization). 

It would have been obvious to one skilled in the art at the time the invention was 
made to modify the invention of Nakamura and Barski, and modify the edge 
enhancement means to use a quadratic preprocessing approach, as taught by Mo, thus 
improving the quality of binarized images, as discussed by Mo (see abstract). 

Regarding claims 3 and 13, Mo discloses using quadratic filters to edge enhance 
images (see abstract of Mo). 

3. Claims 4, 5, 14, and 15 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Nakamura in view of Barski as applied to claims 1 and 1 1 above, and 
further in view of Ricardo L. de Queiroz et al., "Fast Segmentation Of The JPEG 
Compressed Documents" (hereinafter "de Queiroz"). 

Regarding claims 4 and 14, Nakamura discloses the method and device of 
claims 1 and 11, wherein the block classification part comprises an image division part 
for dividing the received image into blocks having a predetermined size (see page 77, 
section 2.1 where Nakamura discusses segmenting the image into blocks of 32 x 
32 pixels). Nakamura also discloses thresholding and classification of an image based 
up on average energy in an image (see page 78, section 2.2 where Nakamura 
discusses using the mean level of the background as a threshold in determining 
block attributes). Nakamura does not explicitly disclose the use of discrete cosine 
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transform conversion blocks and using the energy calculation of said blocks for region 
classification. 

de Queiroz discloses an image classification method including: a discrete cosine 
transform (DCT) conversion part for DCT-converting blocks output from the image 
division part (see page 368, section 2.1 where de Queiroz discusses JPEG 
compression involving the discrete cosine transform); an energy calculation part for 
calculating a sum of absolute values of dominant DCT coefficients within each of the 
DCT-converted blocks, and outputting the calculated sum as an energy value of the 
corresponding block (see page 369, section 2.2 where de Queiroz discusses 
computing the energy of each DCT block using the absolute value of the DCT 
coefficients). 

It would have been obvious to one skilled in the art at the time the invention was 
made to modify the invention of Nakamura, and modify the energy detection means to 
include DCT coefficient block energy, as taught by de Queiroz, thus allowing faster 
segmentation and discrimination of areas in a compressed image, as discussed by de 
Queiroz (see page 367, section 1.1). 

Regarding claims 5 and 15, de Queiroz discloses each of the blocks having a 
size of 8 x 8 pixels and a scalable summation for calculating the energy of each block 
(see pages 369-370, section 2.2 where de Queiroz discusses computing the AC 
energy of 64 pixel blocks). 
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